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Abstract 


Two  methods  for  obtaining  sequential  test  based  confidence 
intervals  for  the  mean  of  a normal  distribution  are  proposed  and 
compared.  One  method  involves  reducing  the  bias  Introduced  into  the 
sample  mean  by  optional  stopping.  The  other  defines  the  confidence 
interval  directly  in  terms  of  the  stopping  time  of  the  test. 


AMS  Subject  Classification.  Primary  62L10 


Key  Words  and  Phrases:  Sequential  test,  confidence  Interval, 

stopping  rule. 
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estimation  following  sequential  tests 


1 Introduction 

In  decision  oriented  liypothesis  testing  sequential  analysis 
olfeis  an  advantage  over  fixed  sample  size  procedures.  In  cases  which 
are  not  close  to  the  borderline  between  hypotheses,  one  can  make  a 
decision  on  the  basis  of  a relatively  small  amount  of  data  and  reduce 
the  cost  of  experimentation.  It  is  commonly  supposed  that  sequential 
tests  are  not  so  useful  when  one  wants  to  supplement  his  test  results 
with  an  estimate  of  some  parameter.  The  basis  for  this  belief  is  the 
observation  that  sequential  tests  are  designed  to  terminate  sampling  as 
soon  as  there  is  sufficient  reason  to  reject  one  hypothesis,  and  it 
Irequently  occurs  that  insufficient  data  is  then  available  for  accurate 
estimation.  A second  reason  for  the  paucity  of  studies  of  estimation 
following  sequential  tests  is  the  mathematical  difficulty  associated 
with  computing  the  sampling  distributions  of  such  estimates. 

The  primary  purpose  ot  this  paper  is  to  suggest  point  and 
confidence  Interval  estimates  tor  use  in  conlunction  with  the  class  ol 
sequential  tests  recommended  by  Armltage  (1975)  for  clinical  trials. 

Two  approaches  are  considered.  One  begins  with  the  customary  fixed 
sample  size  estimator  which  is  then  modified  to  reduce  the  bias  created 
by  random  termination  of  the  experiment.  The  other  and  more  novel 
approach  attempts  to  quantify  one's  intuitive  feeling  that  the  sooner  a 
sequential  test  terminates  the  more  evidence  it  provides  for  a large 
departure  from  the  boundary  between  hypotheses.  To  facilitate 
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understanding  the  proposed  procedures  most  of  the  paper  Is  restricted 
tc  estimating  the  mean  cf  a normal  population  with  known  variance.  The 
more  practical  and  difficult  cases  of  a normal  mean  with  unknown  vari- 
ance and  Bernoulli  mean  p are  discussed  briefly. 

In  clinical  trials  the  desirability  of  sequential  tests  arises 
tc  a large  degree  from  ethical  considerations.  For  example,  if  one  is 
comparing  two  treatments  in  a matched  pairs  experiment,  where  In  each 
pair  one  person  Is  assigned  at  random  to  each  treatment,  and  if  one 
treatment  is  considerably  superior,  the  experiment  should  be  terminated 
as  soon  as  possible  sc  that  only  a small  number  of  patients  are 
subjected  to  the  Inferior  treatment.  To  be  more  specific,  suppose  that 

for  n “ 1,2,...  the  difference  in  response  in  the  rr^  pair,  x^,  is 

2 

normally  distributed  with  unknown  mean  M and  known  variance  a 

independently  of  the  differences  in  the  other  pairs.  Let 

s • X,  + . . . + X , and  for  given  b > 0 define  the  stopping  rule 
n 1 n 

T « first  n ^ 1 such  that  js^j  CTb»^  . (1) 

Let  m be  a positive  Integer.  Consider  the  sequential  test  of  H^:  p « 0 
against  H^:  p ^ 0 which  terminates  sampling  at  T'  • min(T,m)  and 
rejects  If  and  only  If  T £ m. 

Let  6 - p/0.  Tne  distribution  of  T and  hence  the  power  func- 
tion and  expected  sample  size  of  these  tests  depend  on  the  parameters 
(p,0)  only  through  the  value  of  6.  The  tests  have  been  studied  numer- 
ically by  McPherson  and  Armltage  (1971) — see  also  Armltage  (1975) — 
whose  tables  allow  one  to  choose  the  design  parameters  b and  m to 
attain  a given  significance  level  ci  • PQ{T^m}  and  type  II  error 
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probability  B = Ppl.T>m}  at  a iniven  value  f 0.  Thov  have  beeri 

stiidted  asymptotical  Iv  tor  lar^'c  b and  m by  SieRmund  ( 1977),  whose 

approximations  compare  favorably  with  the  numerical  results  ot 

McPherson  and  Armitape,  and  whose  methods  form  the  basis  tor  this 

paper.  Technical  aspects  of  the  asvmntotic  analysis  use  ideas  which  po 

back  to  Anscombe  (1953)  and  have  recently  also  been  studied  bv 

Wocdroofe  (197bb)  in  a context  related  to  the  present  one,. 

The  followinp  heuristic  considerations  suppest  that  an 

experiment  terminated  by  the  stoopinp  rule  T may  be  useful  in  some 

estimation  problems,  and  hence  the  poals  of  estimation  and  Lestinp  may 

not  be  in  conflict  to  the  deprec  commonly  supposed. 

The  estimator  x = s /n  of  p based  on  x,,...,x  has  variance 
n n In 

2 

0 /n.  To  estimate  u (assumed  # 0)  with  a prescribed  proportional 

— "'ll 

accuracy,  one  would  like  to  choose  n so  that  n o ^ b|M|»  where  b is  a 
measure  of  the  accuracy  desired.  This  is,  of  course,  impossible 
because  the  inequality  deflnlnp  n depends  on  tlie  unknown  value  of  ImIi 
but  it  one  is  free  to  itioose  his  sample  si/e  sequentially,  it  suppests 
estimating  |p|  by  |x^|  and  stopping  with  the  first  n 1 sui.  h that 

_L  ' _ ■“ 

n ^0  < blx  I.  This  stonpinp  ru^e  is  .t  the  to cm  (1)  with  b « b 
— ' n ' 

Truncatinp  the  experiment  at  the  observation  has  the  inter nretat ion 

of  accepting  an  estimate  ot  prescribed  absolute  accuracy  when  jul  appear 
to  be  so  small  that  the  prescribed  proportional  accuracy  requirement 
would  necessitate  an  extremely  large  sample  size. 

In  Sections  2 and  3,  where  0 is  assumed  known,  it  will  be 
convenient  to  take  o • 1 , so  6 * p/o  • p. 


2.  Confidence  Intervals 

This  section  attempts  to  quantify  by  means  of  confidence 
Intervals  one's  Intuitive  feeling  that  small  values  of  the  stopping 
rule  T are  evidence  in  favor  of  large  values  of  |0|.  It  should  be 
noted  that  with  minor  modifications  the  conceptual  framework  of  the 
first  part  of  this  section  may  be  adapted  to  various  stopping  rules  and 
parent  distributions.  However,  the  asymptotic  calculations  that  come 
later  rely  somewhat  more  heavily  on  the  definition  (1)  and  the  assump- 
tion of  normality.  See  also  Section  k. 

To  determine  confidence  Intervals  for  9 based  on  observing  T' 
and  S.J.,  (here  as  before  T'  = min  (T,ra)),  it  is  useful  to  define  several 
auxiliary  functions.  For  9 Y ^ » n » 1,2,...,  and  ]5|  < bvV  , let 


0^(n)  - lnf{0:  PgiT^n, 

s.p  •>  0}  ^ y}  , 

(2) 

©2(0  - lnf{0:  Pg{T<m, 

s ;>0}  + P {T>m,  s >C}  > y}  , 
r o m — — 

(3) 

0p(n)  - sup{9:  P0{Tj^n} 

+ P0{T<n,  s^<0}  > >}  , 

(A) 

and 

02(C)  --e2(-o  . (5) 

It  is  easy  to  see  from  continuity  and  monotonicity  considerations  that 

these  functions  are  well  defined  and  that  ^ < 9^  (1-1,2).  It  is  con- 

— 1 — 1 

venlent  to  assume  that  ^^(m)  2,  which  will  be  the  case  provided  that 
2y  > Pg{T  ^m},  the  significance  level  of  the  test  defined  In  Section  1, 
Let 


>8- 
' T 


0} 


,s_  < 

. .p 


0} 


(6) 


and 


A 


I 


e « e, (T)i . 


iT  < ni,s„ 


«2<V^T>.n) 


Then  ^^0  and  the  Interval  0j  is  a 100(1  -2)‘)X  confidence  Interval 
for  9.  The  proof  of  this  last  assertion  is  rather  tedious,  since  it 
involves  separate  consideration  of  several  cases.  It  is  deferred  to 
the  Appendix. 

Exact  computation  of  the  probabilities  involved  in  (2)~(4)  is  a 
rather  complicated  numerical  problem.  A fairlv  easy  asymptotic  analy- 
sis yields  what  appear  to  be  adequate  approximations  for  a wide  range 
of  parameter  values.  Most  of  the  rest  of  this  section  is  devoted  to 
describing  this  asymptotic  analysis  and  to  discussing  an  example. 

In  the  approximate  calculations  that  follow  it  is  assumed  that 
b -*  <»  and  m -*•  «“  in  such  a way  that  for  some  0^^  > 0 

b=*9j»'m.  (8) 


It  loJlcws  from  (1)  and  the  strong  Jaw  ot  large  numbers  thot  as  b -» 
for  all  0 > 0 

b/T  ~ S.J,  ' 0T  and  hence  T ~ (b/0)^  a.s.  (9) 


Thus  by  (8) 

0 for  1 0 I 0 1 

P-{T<mt  -*  ^ , (10) 

1 tor  (0|  ^ 0^ 


so  defines  asymptotically  those  values  of  0 for  which  the  experiment 


Is  terminated  before  the  m — observation. 


The  basic  theoretical  result  of  this  paper  is  summarized  In 


Proposition  1. 

P.{T  <m,b»^-x<s 
6 + — — m 


Under  condition  (8),  for  all  9 and  0<x<'»,  asb-^® 
<b/n}'m”S(^^(e-O^))/Q’'p_jj0  {T(y)<®}exp{(9^-0)y}dy , (i2) 


where  T(x)  = inf{n;  s^ > x}  and  $ denotes  the  standard  normal  density 
function.  For  0 > 0,  ^ 9^^  (12)  also  holds  when  x = ®. 

A heuristic  argument  in  support  of  Proposition  1 Is  implicit  in 
Section  3 of  Slegmund  (1977).  A proof  using  a somewhat  different 
approach  Is  contained  In  the  Appendix.  Several  of  the  Important  Ideas 
go  back  to  Anscombe  (1953). 

Of  course,  for  use  In  calculating  probabilities  It  is  still 
necessary  to  evaluate  the  integral  on  the  right  hand  side  of  (12).  For 
x “ “ and  9 > 0^  Slegmund  (1977)  observed  that  random  walk  theory 
allows  one  to  rewrite  this  Integral  In  a form  suited  for  easy  machine 
computation;  and,  in  fact,  the  same  formula  Is  valid  for  x “ ® and 
0 < 0 < 0^  also. 

By  standard  reasoning  for  0 > 0 as  x -*  ® 

P 0(t(x)<®}  ~ exp(- 2 0x)  (P_g{T (0)  = oo})  ^/20^  (e.g,,  Slegmund,  1975). 

Random  walk  theory  yields  a computationally  convenient  evaluation  of 
P g{T(0)“®}  (Feller,  1966,  p.  395)  leading  to  the  asymptotic  relation 

P_jje  {t(x)<®}  - v(0^)  exp(-6^x)  (x-»®)  , (13) 

where  In  terms  of  the  standard  normal  distribution  function  ^ 

v(0)  - 2 exp{-2  n‘^  <K-  ■j9n‘*)}/0^  . (lA) 

Table  1 gives  selected  values  of  V. 
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TABM',  ] 
v(0) 


0 

V(0) 

. 10 

.9435 

.15 

.9164 

.20 

.8901 

.25 

.8645 

.30 

.8397 

.35 

.8157 

.40 

.7923 

.45 

.7696 

.50 

. 74  76 

.55 

. 7263 

.60 

.7055 

.65 

.6854 

.70 

.6659 

.75 

.6470 

.80 

.6286 

.85 

.6107 

.90 

.59  34 

.95 

.5767 

1.00 

.5604 
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Numerical  evidence,  some  of  which  is  mentioned  in  Siegmund 
(1975),  suggests  that  the  convergence  indicated  by  (13)  is  very  fast, 
and  hence  for  0 < x < “°  a reasonable  apprcxi mation  to  the  integral  on 

X .CO  ,.oo 

the  right  in  (12)  is  obtained  by  first  writing  ~ ~ then 

00 

pretending  that  (13)  is  an  equality  to  evaluate  . For  0 £ 0 and 
fairly  small  values  of  x it  is  probably  reasonably  accurate  to  substi- 
tute the  right  hand  side  of  (13)  directly  into  the  integrand  on  the 
right  in  (12).  Actually,  even  for  0 > 0 and  x = “,  when  exact  evalua- 
tion of  the  integral  on  the  right  hand  side  of  (12)  is  possible,  using 
(13)  and  the  tabled  values  of  (14)  to  obtain  an  approximate  evaluation 
of  (12)  seems  to  be  accurate  enough  for  most  practical  purposes. 

With  the  aid  of  Proposition  1 it  is  possible  to  give  good 
approximations  to  the  probabilities  appearing  in  (2)-(4)  for  most 
parameter  values  of  interest.  For  example,  for  0 < 0 < 0^ 

P.{T<m,  s >0}  - PqIT  <m}  » Pots  > b/ml  + P^{T  < m,  s < b/m}  ; (15) 
u ~ 1 0 T — o m — o T — m 

for  0 > 0 and  x > 0 

P.{T<'m,s_>0}  + P„(T>m,s  >b/m-x}  “ P„{s  >b/m-x}  + P.fT  < m, s <b/in-x}  ; (16) 
y “ 1 y m y nt^  y + m 

and  for  0 > 0^^ 

P.{T>m}  + P-{T  < m,s  <0}  ~ P„{T  > m)  = P.{s  '■  b^i}  - PoiT  < m,s  < b/ii} . (17) 
o 0—  T 0+  Om  0+  m 

The  first  terms  on  the  right  hand  sides  of  (15)-(17)  are  obtained 
exactly  in  terms  of  the  normal  distribution,  and  the  second  terms 
may  be  evaluated  approximately  using  Proposition  1.  For  n m scaled 
for  asymptotic  purposes  by  the  relation 

n “ mr  for  some  0 < r < 1 , (18) 
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ipiiroximat  lr>ns  to  Pf^{T^n.  s^.  O},  etc.  mav  be  obtained  from  similar 

formulae  but  with  0^  replaced  by  P^/r.  The  approximation  Kiven  by  (17) 
and  Proposition  1 Is  essentially  that  sup,y,ested  in  Siepmund  ( 1977)  for 
the  type  II  error  probability  of  the  test  defined  in  Section  1.  It  was 
shown  to  be  quite  accurate  for  computational  purposes  by  comparing  it 
with  exact  numerical  results  of  McPherson  and  Armltage  (1971) — see  also 
Armitage  (1975).  The  author  has  performed  a small  simulation  to  check 
the  accuracy  of  the  approximation  suggested  bv  (15)  and  Proposition  1. 
Although  this  approximation  does  not  seem  to  be  quite  as  good  as  that 
given  by  (17),  its  accuracy  to  within  about  seems  adequate  for 

practical  purposes. 

Table  2 gives  some  examples  of  approximate  90%  confidence 
intervals  for  6 computed  from  hypothetical  data.  The  test  parameters 
are  b = 3.45  and  m = 148,  which  give  a significance  level  at  = .01  and 
power  1-6  = .95  at  9 = .4  (Armltage,  1975,  p.  105).  The  data  were 
selectei  to  correspond  roughly  to  "typical"  outcomes  for  9 = .8,  .b, 

.4,  .27,  and  .2.  For  example,  the  value  T = 19  in  the  first  row  is  the 

nearest  Integer  to  EqT  for  9 = .8.  (According  to  simulations  and  the 

o 

asymptotic  approximations  of  Siegmund,  1977,  E^T  = 18.8  for  9 = .8.) 

The  columns  headed  "Probability"  give  approximately  the  probability  of 
obtaining  a more  extreme  value  than  the  data  in  the  first  column  for 
the  value  0 equal  to  Che  one-sided  confidence  limit  in  the  preceding 
column.  Thus  the  first  row  of  the  third  column  gives  approximately 
P ^^{T^19,  s.j,>0},  while  the  corresponding  entry  in  the  fifth  column 
gives  approximately  Pj  "*■  Pj  s^<0}. 


TABLE  2 


90%  CONFIDENCE  INTERVALS 


Data 

0 

Probability 

0 

Probability 

T =>  19,  s^  > 0 

T 

.35 

.053 

1.15 

.049 

T » 32,  s,  > 0 

T 

.26 

.051 

.89 

.049 

T = 68,  s^  > 0 
’ T 

.17 

.048 

.60 

.050 

T > 148,  s^^g  = 40 

.11 

.048 

.40 

.046 

T > 148,  s^^g  = 30 

.06 

.047 

.34 

.050 

In  regard  to  the  heuristic  discussion  concerning  proportional 
accuracy  in  Section  1,  it  is  interesting  to  note  that  if  one  considers 
the  midpoint  6 = (^+9)/2  of  the  intervals  in  Table  2 as  a point  esti- 
mator of  0,  then  the  ratio  (0-^)/6  is  approximately  constant  (=  1.1) 
In  the  first  four  rows.  In  the  last  row  the  confidence  Interval  is 
essentially  the  same  as  the  standard  fixed  sample  size  Interval  based 
on  the  same  value  of 

There  is  an  apparent  discrepancy  between  the  confidence 
Intervals  given  in  Table  2 and  the  test  of  hypothesis  defined  in 
Section  1,  in  the  sense  that  for  the  data  in  the  last  two  rows  oi 
Table  2 the  Interval  (£,  0)  does  not  contain  0,  although  the  test  of 
Section  1 falls  to  relect  H.:  0 « 0.  Of  course,  the  reason 

j 0 

is  that  Table  2 gives  90%  confidence  Intervals  whereas  the 
significance  level  of  the  test  is  a “ .01.  These  choices  were  made  to 
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provide  an  excuse  for  the  following  discussion. 

For  ly  > PQ{'r£m)  a te.st  which  reiects  6 = 0 If  and  onlv  If 

the  (1-2y)  100%  confidence  interval  defined  above  fails  to  contain  0 

has  as  its  reiection  region  {T  < m}  H{T>Tn,  js  I where 

— m — 

0 < ^ < bi4i  is  chosen  so  that 

2y  = P„{T  < m}  + P-{T  >m,  !s  | > f}  . 

Along  with  its  larger  type  I error  probability  this  test  has  a smaller 
type  II  error  probability  than  the  test  of  Section  1.  It  has  some 
additional  flexibility  which  suggests  that  it  should  be  studied  in  its 
own  right.  Since  this  test  is  defined  by  three  parameters  b,  m,  and 
it  is  possible  to  impose  another  constraint  in  addition  to  the  usual 
determination  of  a and  B.  For  example,  b may  be  Increased  to  provide 
for  more  accurate  estimation  of  6.  Another  oosslbllity  is  to  choose 
a smaller  value  of  m than  the  test  of  Section  1 requires,  so  that  the 
maximum  sample  size  of  the  sequential  test  is  not  so  much  larger  than 
that  of  the  fixed  sample  size  test  of  the  same  a and  B.  Both  of  these 
modifications  tend  to  increase  the  expected  sample  size  for  large  (0| 
and  must  be  evaluated  accordingly. 
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The  methods  of  this  section  may  also  be  used  to  obtain  point 
estimators  of  0.  The  obvious  sugRestlon  Is  the  midpoint  (^+6)/2  of 
the  confidence  Interval  [0^,  0]  for  some  confidence  coefficient 
(1-2y)  100%.  The  midpoint  of  the  0%  interval  Is  perhaps  the  most 
natural.  For  the  data  of  Table  2 the  midpoints  of  the  0%  intervals  are 
essentially  the  same  as  the  midpoints  of  the  90%  intervals.  It  is  not 
known  whether  this  is  true  generally. 

3.  Estimators  Based  on  x^, 

The  estimators  of  the  preceding  section  are  unusual  in  the 
sense  that  when  T ^ m they  are  defined  directly  in  terms  of  T and  only 
indirectly  in  terms  of  = s^/T.  This  has  Important  implications  when 
a is  unknown,  for  then  analogous  arguments  give  estimators  for  the 
parameter  0 * y/o  and  not  p alone — see  Section  A.  In  this  section 
simple  estimators  based  on  , are  suggested  and  some  of  their  proper- 
ties studied,  largely  by  simulation. 

The  naive  estimator  x^,  is  rather  badly  biased  when  |0|  is 
large,  hut  this  bias  is  easily  approximated  and  may  be  reduced  con- 
siderably by  a simple  modification.  To  compute  the  asymptotic  bias  of 
X.J, , as  b and  m -*•  “=  so  that  (8)  holds,  observe  that 

S.J./T  - b//T  + (s^-b/r)/T  - b//f  + (s^/T-b^}/(s.j,  + b/f)  . (19) 

From  (9)  it  follows  that 

8.J  + bv¥  ~ 2b^/0  a. 8.  (0?‘O)  . (20) 
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A Taylor  series  expansion  gives 


b/v'T  = 0 - 6^(T  - b^/e^)/2b^  + 30^(T  - b^/fl^)^/8b^  + ...  . (21) 

The  following  asymptotic  results  have  been  obtained  by  Lai  and  Siegmund 

2 

(1977,  unpublished)  and  Woodroofe  (1976a)  in  terms  of  = 9s^  - n6  /2 

and  =*  inf{n;  S >0}: 

+ n 

EqT  » (b^-i)/e^  + > 


2 0 2 

and  E^(s_/T-b  ) E^S  /2  E^S  as  b > ®.  Substituting  these 
D T O T_^  O 

results  into  (19)  and  (21)  and  appealing  to  (20)  yields  for  9 5^  0 

E.(s.,/T)  = e(l  + 2/b^)  + o(b^)  (b-“)  . (22) 

U 1 

It  follows  heurlstlcally  from  (10)  and  (22)  and  rigorously  after  some 
additional  calculation  that 

EqX^,  = 9(H-2/b^)  + o(b‘^  for  |e!  > 0^ 


» 9 + o(b  ^) 


for  |e|  < 0^ 


This  suggests  estimating  0 by 


9 - (s^/T)/(l  + 2/b^) 


« s /m 
m 


if  T < m 

if  T > m 


(23) 


so  that  E.0  - 0 + o(b*’^)  for  all  l9|  ^ 0,.  Obviously  this  estimator 
should  be  altered  further  to  avoid  the  embarrassing  possibility  of 
estimating  |0|  to  be  larger  when  T > m and  |s^|  is  close  to  b»4i  than 
when  T - m.  It  Is  doubtful  that  such  a refinement  would  significantly 
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alter  the  behavior  of  the  estimator  except  perhaps  for  |9|  close  to  0j^. 
In  any  case  the  author  has  made  no  effort  in  this  direction. 

A similar  calculation  shows  that 


Eg(e  - 0)^  ~ ~ l®i  i 

~ 1/m  I 6 I £ 


(24) 


According  to  (9)  and  a Theorem  of  Anscombe  (1952)  , as  b -* 

/r^(x.j,,-G)  has  asymptotically  a standard  normal  distribution.  Since 

b(0-x.j,,)  -*■  0 in  probability,  the  same  is  true  for  /T'(0-0).  Hence 

with  z defined  bv  1 - <I>(z  ) “ y the  interval 
Y ' Y 

[0  - z^/v^  , 0 + z^//r  ] (25) 

is  an  approximate  (i-2y)  lOOZ  confidence  Interval  for  9.  An  alterna- 
tive interval,  which  is  slightly  longer  when  T £ m,  and  which  empha- 
sizes the  manner  in  which  T provides  for  estimating  0 with  prescribed 
proportional  accuracy  is 

[0  - z^  max(  |x.j,,  I /b,  l//n)  , 0 + 2^  max(  |x.j.,  1 /b,  l/Zn)]  . (26) 

For  the  current  problem  of  estimating  a normal  mean  with  known  variance 
the  difference  between  (25)  and  (26)  is  practically  negligible.  For 
other  parametric  families  and  for  the  case  of  a normal  mean  with 
unknown  variance,  the  analogous  difference  may  be  important — see 
Section  4. 

Table  3 contains  the  results  of  a small  Monte  Carlo  experiment 
to  check  the  accuracy  of  the  preceding  asymptotic  analysis.  The  values 
chosen  for  b and  m were  again  3.45  and  148  respectively.  The  value  of 
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TABLE  3 


ESTIMATED  PROPERTIES  OF  6 OBTAINED  FROM 
A 200  REPETITION  MONTE  CARLO  EXPERIMENT 


0 

Eq0  t Standard  Error 

E^(e-0)^ 

Coverage 

Percentage 

80 

. 79 

.02 

.075 

.895 

60 

. 64 

+ 

.02 

.045 

.905 

40 

.43 

4 

.01 

.051 

.900 

27 

.30 

- 

.01 

.029 

.900 

20 

.21 

+ 

.01 

.915 

1 + 2/b  is  1.168,  so  the  recommended  bias  reducing  factor  decreases 
S.J./T  by  about  14%.  The  value  of  6^^  is  .284.  The  row  corresponding  to 
.27  is  of  particular  Interest,  for  in  a neighborhood  of  0 = 9^  the 
preceding  asymptotic  analysis  can  be  expected  to  yield  good  approxima- 
tions only  for  b quite  large.  (Actually  for  0 = 0j  the  analysis  given 

for  EqCs,,.,/T’)  breaRs  down . “'AlTemaTi've  calcniatlrms  v-teld  a bias  of- 
o I 

-2 

order  b , but  the  details  have  been  omitted.) 

The  figures  in  Table  1 suggest  that  0 has  a small  positive 
bias,  but  that  the  bias  is  much  smaller  than  that  of  the  unmodified 
estimator  s.j,,/T'.  The  mean  square  error  is  poorly  predicted  by  the 
asymptotic  theory.  However,  the  mean  square  error  of  the  unmodified 
estimator  s.j,,/T'  (which  is  not  reported  here)  seems  to  be  consistently 
about  50%  larger  than  for  0.  The  percentage  of  times  the  interval  (26) 
covered  the  true  parameter  0 is  remarkably  close  to  the  nominal  .90. 
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The  following  discussion  explores  the  relation  between  the 
approximate  confidence  intervals  given  by  (26)  and  those  of  the  pre- 
ceding section.  As  a first  step,  it  is  Interesting  to  compute  the 
Intervals  given  by  (26)  for  the  hypothetical  data  in  Table  2.  In  order 
to  do  this  for  the  data  of  the  first  three  rows,  it  is  necessary  to 
hypothesize  a value  for  s^  in  addition  tc  T.  But  s^  = b/F  -t-  (s^  - b/F) ; 
and  it  follows  from  results  of  Lai  and  Siegmund  (1977,  unpublished)  or 
Woodroofe  (1976a)  that  as  b E„(s_-bt^)  ->■  /26EqS  , where 

D 1 ° '^4.  ” "^4. 

2 ^ 

S ” 6s  - n0  /2  and  T = lnf{n:  S >0}.  It  may  be  shown  that 
n n + n 

2 

E„S  /20EqS  * .584  ...  + 0/8  + o(6)  as  6 -*•  0,  which  suggests  using 

® V ® V 

s^  = i-  .584  (27) 

as  a hypothetical  value  for  s^  as  a function  of  T.  Table  4 gives 
approximate  90%  confidence  intervals  for  6 for  the  hypothetical  data  of 
Table  1 with  s^  approximated  by  (27)  . The  intervals  In  Table  4 are 

TABLE  4 

APPROXIMATE  90%  CONFIDENCE  INTERVALS  FROM  (26) 


Data 

0 

Lower  Limit 

Upper  Limit 

T 

- 

19,  s^ 

• 

15.62 

. 704 

.31 

1.10 

T 

- 

32, 

’ T 

m 

20.1 

.538 

.24 

.84 

T 

- 

68,  8_ 

m 

29.0 

.366 

.16 

.57 

T 

> 

1^8.  Sue 

m 

40 

.270 

.13 

.41 

T 

> 

1^8. 

m 

30 

.203 

.07 

.34 
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about  the  same  length  as  those  in  Table  2.  For  data  T ^ m they  tend  to 
be  shifted  slightly  toward  smaller  values  of  lo],  which  is  a reflection 
of  the  fairly  substantial  bias  reducing  factor  which  goes  into  0. 

It  is  also  possible  to  give  a crude  analytic  approximation  to 
the  coverage  probability  of  (26).  It  Is  easy  to  see  that 

Pg{0  - 9 > z^|7^|/b,  s^>n}  = Pg{T<m((s^-bK¥)//r),  s^  > O)  , (28) 

where  m(x)  ” [0  ^{b(l  + 2/b^)  ^ - z^} { 1 + x/b}  ] ^ . If  the  argument  of  the 
function  m on  the  right  hand  side  of  (28)  were  not  random,  the  methods 
of  the  preceding  section  would  give  an  approximation  to  (28) . Since  T 
is  an  integer  valued  random  variable  and  very  small  changes  in  x 
do  not  change  the  integer  part  of  m(x) , it  does  not  seem  com- 
pletely outrageous  to  replace  the  random  variable  (s^-b/r)//r  by  an 
approximation  to  its  expectation.  Equation  (9)  and  the  discussion  pre- 
ceding (27)  suggest  considering 

P„{T<m(b"^0(. 584  + 9/8))}  (29) 

w — 

as  an  approximation  to  (28).  For  b = 3.45,  m = 148,  z^  = 1.645  and 
9 » .6  this  argument  yields  the  approximation 

Pq{0  - 0 > z^|x^|/b}  S .030  , (30) 

and  a similar  calculation  gives 

Pg{6-0<  -z^|x.j.|/b}  S .073  , (31) 


so  that  the  probability  that  interval  (26)  fails  to  cover  0 is  about 
.030  + .073  » .103.  The  corresponding  calculations  for  0 - .4  yield 


respectively  .03A,  .076,  and  .109.  Although  these  calculations  should 


not  be  taken  too  seriously.  It  is  Interesting  to  note  that  the  right 
tall  probability  In  (30)  Is  smaller  than  the  left  tall  probability  in 
(31),  which  Is  consistent  with  the  earlier  observation  that  the 
Intervals  (26)  are  shifted  towards  smaller  values  of  j9|  than  those  of 
Section  2.  Also  the  total  coverage  probability  seems  to  remain  reason- 
ably close  to  the  nominal  .90,  which  Is  consistent  with  the  Monte  Carlo 
results  presented  In  Table  3.  The  author  has  performed  a few  calcula- 
tions for  other  values  of  b and  9 and  obtained  results  which  are 
reasonably  consistent  with  those  reported  here. 

4.  Remarks  on  Other  Parametric  Models 

The  results  of  the  preceding  sections  should  extend  in  a fairly 
straightforward  manner  to  problems  involving  other  one  parameter 
exponential  models.  At  least  there  seem  to  be  no  conceptual 
difficulties,  although  the  technical  requirements  of  the  approximate 
calculations  of  Section  2 may  be  considerable.  The  important  case  of 
matched  pairs  of  Bernoulli  outcomes  may  be  reduced  to  a one  parameter 
model  by  the  customary  practice  of  discarding  success-sucess  and 
failure-failure  pairs  (Wald,  1947,  p,  107),  However,  numerical  compu- 
tation of  the  constants  which  enter  Into  the  asymptotic  formulas  of 
Section  2 may  not  be  appreciably  simpler  than  exact  numerical  computa- 
tion of  the  probabilities  from  the  difference  equations  they  satisfy. 

By  way  of  contrast  the  case  of  a normal  population  with  unknown 
mean  and  variance  presents  a new  conceptual  problem.  Now  assume  that 

are  Independent  and  normally  distributed  with  unknown  mean  U 
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2 

and  variance  o . SleRmund  (1977)  suggests  a stopping  rule  analogous  to 
(1)  for  testing  u = 0 against  H^:  u 0 defined  in  terms  of  the  log 
generalized  likelihood  ratio  statistic 


Z 

n 


= i n logd  + x^/v^) 
z n n 


Here  x = n ^ e!?  x, 
n Ik 

A. 

- 2, 3, . . . let  T 


niQ.  + 1... 


and  = n ^ -x  )^.  For  a > 0 and 

n 1 k n 

= first  n > m_  such  that  Z > a.  For 
— 0 n 

A A 

Stop  sampling  at  T’  = mln(T,m)  and  reject 


if  and 


only  if  T £ m.  An  asymptotic  approximation  for  the  significance  level 
of  this  test  as  m^  -►  <*>,  m -►  »,  and  a -*•  “>  in  such  a way  that  0^  = /Za/m^ 
and  » /2a/  m remain  fixed  is  given  by  Slegmund  (1977),  and  an  approx- 
imation to  the  power  may  also  be  obtained. 

An  attempt  to  adapt  the  estimation  methods  of  Sections  2 and  3 

leads  to  several  new  problems.  The  distribution  of  the  Z and  hence  of 

n 

A 

T depends  on  the  parameters  (u.o)  only  through  the  value  of  9 = \s/a, 
and  thus  the  method  of  Section  2 yields  confidence  Intervals  for  9 but 
not  for  u alone.  In  a clinical  trials  context  the  primary  goal  of 
experimentation  is  decision  oriented:  to  recommend  the  superior  treat- 
ment if  there  is  an  appreciable  difference  in  their  effects;  and  a con- 
fidence Interval  provides  a more  useful  way  of  measuring  this  differ- 
ence than  a simple  test  of  hypothesis.  However,  the  exact  parameter 
used  to  measure  this  difference  seems  not  to  be  terribly  Important,  and 
for  this  reason  a confidence  Interval  for  0 may  be  as  useful  as  one  for 
the  customary  p.  The  computations  required  to  give  confidence 
Intervals  for  0 by  the  method  of  Section  2 are  quite  complicated  and 
will  be  considered  in  a future  publication. 
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Estimation  of  9 based  on  may  be  carried  out  in  much  the 

same  fashion  as  in  Section  3. 

In  principle  p may  be  estimated  by  x^,,  perhaps  adjusted  for 
bias,  and  approximate  confidence  Intervals  analogous  to  (25)  or  (26) 
may  be  given.  However,  it  is  not  true  in  general  that  small  values  of 
T are  evidence  in  favor  of  large  values  of  j jj  | (although  they  are 
evidence  of  large  values  of  |8|).  Two  new  facts  make  this  line  of 
attack  more  complicated  than  in  the  case  of  known  a. 

Computation  of  the  asymptotic  bias  of  x^  seems  much  more  diffi- 
cult than  In  the  case  of  known  a.  Presumably  the  bias  Is  smaller  than 

before  because  small  values  of  T may  be  caused  by  large  values  of  x or 

n 

2 

small  values  of  v^  or  both.  A small  Monte  Carlo  study  not  reported  In 
detail  suggests  that  some  adjustment  Is  advisable  but  that  the  adjust- 
ment used  in  the  case  of  known  O Is  slightly  too  large. 

Given  a satisfactory  point  estimator  M for  p,  analogous  confi- 
dence Intervals  corresponding  to  (25)  and  (26)  are  respectively 

[u-z^v-,/*^  , p+z^v-,/v^]  (32) 

and 

fu  - z^max(|x;^,  |/i/2a  , v^/^m) , p + z^max(  (x",  | //2a  , v^//n)]  . (33) 

The  second  interval  has  the  interpretation  of  estimating  p with  pre- 
scribed proportional  accuracy  whenever  T ^ m.  It  Is  longer  than  the 
first  interval,  and  perhaps  considerably  longer,  by  virtue  of  the 


inequalities 


Determination  of  a reasonable  estimator  y and  of  the  relative  merits  of 
(33)  and  (34)  awaits  further  study. 


Appendix 


Proof  That  [^,  6]  Has  Required  Coverage  Probability 

The  notation  and  definitions  are  those  of  Section  2.  It 

* 

suffices  to  show  for  all  6 that  P0{^^9}  That  P^{d  < 9}  < y is 

proved  similarly  and  these  two  statements  together  imply  that 
Pq{^_^6^9}  ^ 1 ~ 2y . There  are  three  cases  to  consider;  (I) 

^^(m)  £ 0,  (II)  ^j^(m)  > 6 and  £ 9*  (HD  ^ ® 

necessarily  ^^(m)  >0).  In  case  I P0iT_^m,  s^ > 0}  ^ > and  hence 

Pq{0>9}  = PgfTjCm,  s^>0,  0^(T)>9} 

= P„{T<n,  s >0},  where  9,(n)>0>0fn+l)  , 

0—1  —1  1 

=•  y 

In  case  II  P.{T  < m,  s > 0}  < y < P.{T<m,  s > O}  + P-{T>m}  and  hence 
0—  I — 0—  T 0 

P„{0  > 0}  - P„{T  < m,  s_  - 0}  + P„{T  >m,  0,(s  ) > 0} 
y — y — 1 y — z m 

• P„{T  ^ m,  s >0}  t-  P„{T>m,  s >0,  where  * 9 , 

0—1  0 m —i 

• Y 

Case  III  is  treated  similarly  and  is  omitted. 

Proof  of  Proposition  1. 

The  following  argument  gives  an  asymptotic  upper  bound  for  the 
probability  on  the  left  hand  side  of  (12).  The  corresponding  lower 
bound  is  obtained  by  a similar  but  considerably  easier  argument.  For 
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k.  = 0,1,. ..,m  and  0 < x < since  Is  sufficient  for  0 and  (8)  holds 

P.{T  m,  bvm  - X s < bv^}  < P^fT.  ' m-k,  b/n-x^s  < b/ii} 

0+  — m — 0+  — m 

+ m ^(f(v'm(6,  ~ 0)^  >bv'^  for  some.  m-k<n<m|s  =0m-v}  (34) 

1 0 n —'ml 

• expl (6^  ” 0)y  “ y^/2m}dy 


Lemma  1 . For  fixed  k = 0,1,...,  and  0 < v < °" 

I’  {s  > bv'ri  for  some  m-k<n<m|s  = b>4n  - y)  ^ i n (v)  < k) 

On  — m — 

Proof . The  conditional  probabilitv  above  mav  be  rewritten 

P {s  -s  <01411 -b/n-v  for  some  m-k<n<m|s  =bi4n-y} 

0 m n - m 


= P„{s  , < bv4n  - b/m  - i - y for  some  0<i<k|s  =6,  m-v) 
Of  —ml 


It  is  easily  verified  bv  straiRhtf orward  calculation  that 

j 1 3/2  1. 

b»'m  - b/m  - i = ■jm"'^  bi  + n(m~  '^b)  -*-j0ji  and  the  iolnt  density  of 

X ,...,x  given  s = m 0,  - v converges  to  that  of  independent  normal 
11  ml 

random  variables  with  mean  0j  and  variance  1.  The  lemma  follows  easily 
Lemma  2 . For  0 < 9 < 0^  and  fixed  k = 0,1,..., 


Pg{T_^<m-k}  £ ^ <t>(  (0j^  - 6)  i^Ti)  , 


where  EoCk)  does  not  depend  on  m and  converges  to  0 as  k -►  «> 
rJ 


Proof . 


P„{T,<.m-k}<  I P„{s  >0^17} 
0 + - ^ , 0 n 

n<m-k 


m-1 

F-  [l  - ${(0  -0)i4ii+0(*4n  - /m-  1)}]  . 
1-k+l 
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T-  1 1 -1 

The  inequal  itles  t'm  - Vm  ~ i ^ — m i and  ]-'J>(x)_<x  (fCx)  (x^O) 
yield  the  required  result  with 

00 

e (k)  = (0  -6)  ^ I expi-^OCB,  - 9)1  } 

® ^ l»k+l  ^ ^ 

Lemma  3 . For  each  9,  k = 0,1,...,  and  0 < x < “> 

^ 2 — ^ r- 

Pq{T,  < m - k,  b/n  - x ^ s < bv'm}  < EoCk)  exp(x  / 2m)m  ili(v^(0,  - 9) ) , 

o + — m ~ 0 1 

where  E»(k)  does  not  depend  on  m and  converges  to  0 as  k ♦ °°.  For 

U 

9 > 0,  E^Ck)  may  be  chosen  to  be  independent  of  x. 

U 

Proof . For  0 < 6 < 0^  this  result  is  already  implied  by  Lemma  2.  By 
(8)  and  sufficiency  of  s 

m 

P.{T<m-k,  b>^i-x<s  < b/ri} 

0 + ~ m 

(35) 

= m ^'l)(fm(6,  - 9))  Pn'lT  <-m-k|s  =b/n  - vlexp{  (6,  - 6)y-y^/2m}dy  . 
I U u *t-  m i 

Setting  6 = y < 9j^  in  (35)  gives 

P^{T^<  m-k}^m  ^ (])(/n(6j^  - r)  )exp(-  x^/  2m)  P^{T^  < m - k | s^  = bi4n  - y }dy 

and  hence  by  Lemma  2 

L Pn{T  <m-k|s  “b»^-y}dy  < exp(x  /2m)  mf  E.,,(k.) 

0 0+  m - 0<Y<ej^  ' 

Substituting  this  inequality  back  into  (35)  gives  the  desired  result. 

That  the  right  hand  side  of  (12)  is  asymptotically  an  upper 
bound  for  the  left  hand  side  follows  at  once  from  (34),  Lemma  1,  and 
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Lemma  3 In  the  case  x < “. 


For  the  case  x = «>,  for  arbitrary  finite  x' 


P-{T  <m,  s < bi/m}  = Pq{T^  < m,  bv^-x'<s  < bv^} 
y + m y + m 

(36) 

+ PqIT,  <m,  s <b/Ti“x'} 

W + in 


For  0 > 

PQ{T<m,  s <bv^-x'}<PQ{s  <m0  -x'}  = ${/n(0.  - 9)  - x'//n} 
y+m  — yml  1 

£ exp{  (6^  - 6)x' } (^(✓m(0^  - 6))//n(0  - 6j^)  , 


so  the  desired  result  follows  from  the  case  of  finite  x by  first 
letting  m -*■  <»  and  then  x'  “ in  (36).  The  required  estimate  for  the 
second  probability  on  the  right  hand  side  of  (36)  in  the  case 
0 < 0 < 9j^  is  provided  by  Lemma  2 and  the  following  result. 

Lemma  4 . For  0 < 0 < 0^,  k = 0,1,...,  and  0 < x < ® for  all  m 
sufficiently  large 

Pq {m  - k < < m,  s <bi'm-x  } < 6(x)  - 9))/*^  , 

y — + m “ 1 

where  6(x)  ->•  0 as  x 

Proof . The  proof  follows  easily  from  the  argument  of  Lemma  2 and  the 
inequality 


P^fm  - k < T < m,  s < bi4i  - x}  < Pq{T  = n}PQ{s  -s  <b/n-bi/n-x} 

0 -+  m 0mn 

m-k<n<m 


< E {1 -<I>(b-9»'n^)}4>((0  -0)/i-x//T) 
0<l£k  ^ 
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